Critical reflexivity in financial markets: a Hawkes process analysis 
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We model the arrival of mid-price changes in the E-Mini S&P futures contract as a self-exciting 
Hawkes process. Using several estimation methods, we find that the Hawkes kernel is power-law with 
a decay exponent close to —1.15 at short times, less than « 10^ seconds, and crosses over to a second 
power-law regime with a larger decay exponent r; —1.45 for longer times scales in the range [10"^, lO''] 
seconds. More importantly, we find that the Hawkes kernel integrates to unity independently of the 
analysed period, from 1998 to 2011. This suggests that markets are and have always been close to 
criticality, challenging a recent study which indicates self-refiexivity (endogeneity) has increased in 
recent years as a resuh of increased automation of trading. However, we note that the scale over 
which market events are correlated has decreased steadily over time with the emergence of higher 
frequency trading. 



I. INTRODUCTION 

The "excess volatility puzzle" is the name coined to 
describe Shiller's [l| and LeRoy and Porter's obser- 
vation that stock prices move too much to be justified by 
changes in subsequent dividends, in contradiction with 
the Efficient Market Hypothesis. Although hotly debated 
ever since among theorists (see Q for a short review), 
many empirical studies have confirmed that a large frac- 
tion of the volatility of asset prices cannot be explained 
by changes in their fundamental value. For example, 
most large price jumps seem to be endogenously gen- 
erated rather than related to the arrival of news in the 
market place . Although excess volatility, crises and 
crashes seem to be as old as financial markets themselves 
0, 3: the debate has recently been stoked by the rise 
of "high frequency trading" which is blamed for having 
made markets more volatile and prone to large magni- 
tude events such as flash-crashes without the need for 
exogenous triggers. Another closely related debate con- 
cerns the financial transaction tax (FTT) as a tool to 
curb the excessive volatility of markets. (Most academic 
studies however suggest that a FTT would in fact in- 
crease volatility Q.) 

Keynes [l(i], Soros [III and others (see e.g. [H, IT3| ) 
have argued that a substantial part of the market ac- 
tivity is of endogenous, "self-refiexive" origin. However, 
real progress has been hobbled by the lack of statistical 
tools that would enable one to define and measure pre- 
cisely the degree of self-reflexivity in financial markets. 
Recently, Filimonov and Sornette (FS) Q have applied 
the so-called self-exciting Hawkes process formalism to 
market events, with the aim of quantifying precisely en- 
dogeneity and exogeneity in market activity. Hawkes pro- 
cesses were originally introduced to model the clustered 
occurrences of earthquakes. Intuitively, earthquakes re- 
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lax accumulated stresses between tectonic plates - but by 
doing so other regions are brought closer to their yield 
stress, leading to an inter-twined pattern of foreshocks 
and aftershocks. Hawkes processes describe the arrival 
of these seismic events as a Poisson point process but 
with a history dependent intensity, that increases when 
a seismic event just occurred. The same picture lends it- 
self naturally to the arrival of events in the market place. 
Not surprisingly, the Hawkes formalism has generated a 
fiurry of activity as a means of modelling volatility and 
market dynamics [T5l - [l8| . 

The results of Filimonov and Sornette are twofold. 
First, their Hawkes process analysis over the period 1998- 
2010 on the E-mini S&P futures contract suggests that 
the degree of self-refiexivity has increased steadily in the 
last decade, an efl'ect they attribute to the increased de- 
ployment of high-frequency and algorithmic trading. Sec- 
ond, when calibrated over much shorter time intervals (10 
minutes), the Hawkes process analysis is found to be able 
to detect precursors of the flash-crash that happened on 
May 6th, 2010. Devising such early warning tools is ev- 
idently beneficial (in particular for market regulators). 
The work of FS is, in this respect, in a similar vein to 
that of Easley, O'Hara and Lopez de Prado who devel- 
oped their "VPIN" indicator [l9|. 

The aim of the present paper is to revisit the first claim 
of FS that concerns the secular increase of self-refiexivity 
in financial markets. (Although for the moment we re- 
serve judgement on the second claim, which is of course 
of high interest as well). Our results are markedly dif- 
ferent from those of FS. We find that the degree of self- 
refiexivity has been extremely stable over time and in 
fact equal to the critical value beyond which the cali- 
brated Hawkes process becomes ill-defined. We also find, 
in agreement with the recent work of Bacry et al. [Tt} , 
that the "infiuence kernel" of the Hawkes process (see 
below for a precise definition) decays as a power-law in 
time, with two different regimes and a crossover time 
of around five minutes. This is the main explanation for 
the discrepancy between our own analysis and that of FS, 
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who chose an exponentially decaying influence kernel. 

We consider our result very plausible as they fit well 
with results previously reported in the literature. For 
example, as we show in the next section, the power-law 
decay of the influence kernel is related to the long-range 
memory of the market activity, which is one of the best 
documented stylized facts of financial markets. Further- 
more, that markets operate in the vicinity of a criti- 
cal point has been repeatedly ernphasized in the econo- 
physics literature, see e.g. [1, [H, [20l - l25| . Within a 
Hawkes process description, criticality is related to both 
long-range memory and the prevalence of endogeneity 
over exogeneity in a precise manner. Finally, many stud- 
ies conclude that the statistics of financial markets have 
been remarkably stable over time and assets (see e.g. 
[T3I [26j and refs. therein) - a result that would have 
been hard to reconcile with the strong increase of self- 
refiexivity reported by FS. 



II. HAWKES PROCESSES: DEFINITION AND 
SOME USEFUL RESULTS 

A. The general case 

A one-dimensional Hawkes process is a non- 
homogeneous Poisson process N{t) with an intensity 



X{t) = E described by a constant base intensity 

/i and a self-exciting term that is a weighted sum over 
previous events: 



\{t) (l){t-s)dN{s) 



(1) 



The "influence kernel" (/)(r) > describes the effect on 
the instantaneous event rate X{t) of a past event that took 
place at time t — t. Assuming stationarity, and taking 
the expectation of both sides of Eq. (P) allows one to 
compute the average intensity A, which is the solution 
of: 



A 



A 



{T)dT. 



(2) 



Defining n = (f>{T)dT, the above equation only has a 
meaningful solution provided n < 1, in which case A — 
— n) > ji. When n = 0, the Hawkes process is 
a homogeneous Poisson process with intensity /j,. For 
n > 1, the intensity of the process defined by Eq. ([T]) 
explodes in finite time. The critical case n = 1 is special 
and will be discussed below. Hawkes processes can be 
mapped into the well known branching processes [27j : 
exogenous "mother" events occur with intensity /z, and 
give rise to endogenous "child" events, which themselves 
give birth to more child events. The quantity n is the 
branching ratio of the process, i.e. the expected number 
of additional events that any single event triggers. As 
is well known, branching processes become critical when 



n reaches unity, and give rise to infinite families with 
strictly positive probability when n > 1. 

The functional form of (^(r) further defines the type of 
Hawkes process. A popular form, chosen by FS in their 
analysis of market data, is the exponential kernel: 



{r) 
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Note that defines the memory time of the Hawkes 
process, beyond which past events have a negligible in- 
fiuence on the instantaneous event rate. However, other 
forms are possible too. For example, a power-law kernel 
with a short-time cutoff tq: 



(t) = e(T - To)(po- 



e > 0, 



(4) 



where 0(.) is the Heaviside function. In this case, the 
branching ratio is n = 'fo/e- 

A quantity that can easily be measured empirically is 
the auto-covariance of the process, defined as: 



dN{s) - Ah 




T+h 



dN{s) - Ah 



(5) 

where h is some small time interval. 

Bacry et al. showed that in the small frequency limit, 
the Fourier transforms of v and 4> are related through the 
following equation [iTj : 



;>(w) = A 



1 



1 - cb{uj) 



(6) 



Letting w = above leads to: 



v{T)dT = A 



1-n 



n = 1 — 



A 



2j^v{T)dT 

(7) 

Now the autocorrelation function can either decay faster 
than for large r, corresponding to a short-memory 
process for which v{T)dT < 00, or slower than 
corresponding to a long-memory process for which now 
v{T)dT is infinite. From the above equation, one 
therefore concludes that long-memory of the autocorre- 
lation necessarily implies criticality, i.e. n — 1. 



B. The critical case 

Let us study this critical case in more detail. Assuming 
that the Hawkes kernel is a power-law as above, with 
an exponent < e < 1, one finds that the small wtq 
expansion of reads: 



;) = n — A{ujToy 



(8) 



where A is a complex number, related to ipo and e. (Note 
that when e > 1, the first term of the expansion of 



3 



(j) becomes regular, i.e. cx w.) Similarly, for an auto- 
covariance v{t) which decays asymptotically as a power- 
law oc T^" for some < a < 1, one finds: 

i>{lu) = Buj°'-^ (9) 

with some real constant B. Note that a < 1 corresponds 
to long-range memory, and i'{uj 0) ^ oo, leading to 
n = 1 as alluded to above. Inserting these expressions 
into Eq. ([6]), we obtain, for a; — 0, 

Bio"-^ « \A\-^Lj^^', (10) 

leading to: 

a = l-2e, B^\A\-^ (11) 

The above relation implies that e must in fact lie in the 
restricted interval ]0, ^[ for a to be in the range ]0, 1[, 
as assumed. But since A = /i/(l — n), the limiting case 
n = 1 only makes sense if simultaneously /i = 0, i.e. the 
total number of exogenous events grows sub-linearly with 
the sample length. Interes ting ly, the above construction 
has been made rigorous in [28[ in their study of "Hawkes 
processes without ancestors" . 

Another way to think about this limit, which may be 
more appropriate for financial applications, is to assume 
that n = l — 5, 5 <^\. The autocorrelation function then 
decays as a power-law oc r^" up to a (long) crossover 
time r* ^ (5~^/^'^, beyond which it decays much faster, in 
order to make v{t) integrable. 

What happens when e > ^ but lpq — € such that n — 1 
(see Eq. (HI))? Naively, the above analysis predicts a < 0, 
i.e. an autocorrelation that increases without bound as 
T ^ oo. This is of course impossible, and signals that in 
this case, the critical Hawkes process is non stationary, to 
wit, the average instantaneous rate increases with time. 
This means that an initial time is needed to make the 
process well defined. 

The reason we have focused so much attention on this 
critical case is that, as we will show below, the empirical 
data clearly suggests that the relevant Hawkes process is 
indeed critical and stationary, i.e. precisely the Hawkes 
processes without ancestors considered by Bremaud & 
Massoulie [H]. 

III. POWER-LAW VERSUS EXPONENTIAL 
HAWKES KERNEL 

As mentioned above, FS estimated the parameters of a 
Hawkes processes on the midpoint changes of the E-mini 
S&P Futures contract in the period 1998-2010 assuming 
an exponential kernel. However, this choice can only be 
justified if the dynamics of the market can be charac- 
terized by a single time scale, This is at variance 
with many empirical studies that clearly elicit a scale- 
free, long memory of market activity. This strongly sug- 
gest the use of a power-law influence kernel. As a matter 



of fact, Bacry et al. [17j have developed a non-parametric 
estimation technique that yields a power-law kernel over 
several decades for incoming market orders on both the 
Bund and the Dax futures contracts. They find that the 
exponent e (defined by Eq. (|4])) is small. 

We will revisit this non-parametric determination of 
the kernel in Sect. IIVI but for now we wish to repeat the 
analysis of |:14] with the hypothesis of a simple power- 
law kernel, as given by Eq. (j?]). To allow for changes to 
the high frequency end of the Hawkes kernel over time 
resulting from improved temporal resolution of market 
events and reduced trading latency, we consider a power- 
law kernel with smooth exponential cut-off for short lags. 
Furthermore, to expedite the calculation of the likelihood 
of the Hawkes process, we approximate the power-law as 
a sum of exponential functions with power- law weights ^ . 



(/)(rl/x,n,e,To) = 




(12) 

The scales of the exponentials are given by: 

6 = Tom* (13) 

and S and Z are constants chosen such that (/)(0) — 
and (j){T)dT = n. For the empirical calibration, we 
have iixed M = 15 and m = 5. For values of e close to 
zero as we expect, the resulting function describes closely 
a power-law form with tail exponent r^(^+'^-' but the 
negative exponential term provides a smooth cut-off at 
short lags. 

Given observations of empirical market events (mid- 
point changes) at times ti, ■ • • , in an interval [0, T] 
we can fit the model by maximising the log-likelihood 
pol HO] over the set of parameters 6 — (/i, n, e, h). 

\ogL{ti,t2,...,t^\e) = ~ [ \{t\e)dt+j log \{t\e)dN{t) 

Jq Jq 

(14) 

The data we use consists of a sequence of times in the 
period 1998-2011 when the mid-point of the bid and ask 
price of the E-mini S&P contract was observed to change. 
Changes in this mid-point may have been the result of 
executed market orders or limit orders that have been 
placed or cancelled. Our data is time-stamped to the 
millisecond, but since sometimes multiple events occur 
during the same millisecond - the limit of our temporal 
resolution - we employ a randomisation procedure. We 
give each event a real number time-stamp distributed 
randomly and uniformly within the millisecond that it 
was reported. A similar operation is performed in [l^ . 
but within the second resolution that they have. 



By approximating with a sum of exponentials we can take advan- 
tage of a recurrence relation which reduces the time complexity 
of the log-likelihood calculation in Eq. (O from 0{N^) to 0{N) 
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FIG. 1: Results for the MLE fitting of the Hawkes process with power-law kernel to mid-point changes in the E-mini S&P 
Futures contract. The fit is performed over two-month periods between Jan. 1998 and Dec. 2011. Top-Left: The branching 
ratio n appears to change little and fluctuates around the critical value n = 1 for the 14-year period studied. Top-Right: The 
high frequency cut-off tq, on the other hand, decreases exponentially over time as a result of the increasing frequency at which 
events are correlated, this trend is not too far out of line with Moore's law, the oft-cited observation that the processing speed 
of computers doubles every 18 months. Bottom-Left: The base intensity fi (in events per second) that has remained roughly 
constant over time, apart from spikes during crisis periods. Bottom-Right: The exponent e entering the power-law kernel <^(r), 
that has also remained roughly constant around 0.15 - see below. 



In fitting the model we only consider market events oc- 
curring during Regular Trading Hours (09:30 to 16:15). 
Furthermore, we take account of the 'U-shape' intra-day 
activity profile. Without appropriate detrending, the 
fitting procedure would interpret the slow variation in 
event rate during the day as arising from the shape of 
the Hawkes kernel. To this end, we consider a detrended 
Hawkes process model 



Km 



1 



wit) 



w{s)(t){t~s\0)dN{s) 



(15) 



where w{t) is a periodic function defined over the trad- 
ing day that dictates the weight to be appropriated to 
events occurring at different hours. The aim is to give 
less weight to events occurring during the morning and 
late afternoon when high activity is to be expected. The 
weights are chosen to be the reciprocal of the empiri- 
cal average daily event rate at that moment of the day 
w{t) = normahsed such that E[R(t)] = E[w{t)] = 1. 

The model parameters as a function of time are cal- 
culated between 1998 and 2011 on non overlapping two- 
month periods using an estimate of the function R{t) 
made for each year. All regular trading hours in each 



two-month period are concatenated to form a single con- 
tinuous point-process for which the log-likelihood of Eq. 
(IT4l) and Eq. (fT5|) is maximised with respect to the four 
parameters 6 = {/i, n, e, tq}. The results are shown in 

Fig.m 

Our key observation is that the branching ratio ap- 
pears to change little over the course of the fourteen years 
studied but fluctuates about the critical value nc = 1 cor- 
responding to the onset of stationarity. This means, that 
within the Hawkes framework, the majority of events ob- 
served in the market can be attributed to the long-range 
influence kernel, both now and in 1998. However, we 
see that the short lag cut-off of the kernel has decreased 
exponentially over time. This can be attributed to the 
steady emergence of higher frequency trading despite lit- 
tle change in the event rate. It may surprise the reader 
that the event rate has not increased signiflcantly in the 
14-year period (the event rate, for example in 2009 is 
comparable with that in 1999) despite the growth of the 
electronic futures market, but understand that the events 
studied are mid-point changes, not trades or order-book 
events and as such it is a better proxy for volatility than 
transaction quantity or volume. 
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Note that for the stationary critical Hawkes process 
with n = 1 described in Sect. IIIBi the base intensity 
/i must be zero. The small "excess" base intensity ob- 
served in our results can be explained as arising from 
the contribution of the part of the power-law kernel that 
extends beyond the two-month window on which the fit 
is performed. The integral of this remainder is approxi- 
mately e (■^) where T w 2 months. Indeed, since each 
two month period is fitted independently, without prior 
event history, there must be some non-zero base intensity 
to explain the observation of any event at all! 



in the same way as FS. The resulting branching ratio ^ 
was then averaged over all 100 windows in each ensem- 
ble. The results of this analysis are shown in Fig. [21 and 
resemble much the data presented in FS (see their Fig. 
4, panel D). Despite the fact that the dataset is gener- 
ated from a critical Hawkes process with n = 1.0, the 
analysis with the exponential kernel model on 30-minute 
windows reveals a sub-critical branching ratio which in- 
creases steadily over time. 



IV. NON-PARAMETRIC ESTIMATION OF THE 
HAWKES KERNEL 
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FIG. 2: The analysis of [1J| on a critical power-law Hawkes 
processes. We generate synthetic power-law Hawkes processes 
with parameters {fj,,e,n} — {0.02,0.15, 1.0} but with a cut- 
off To drawn from Fig. [1] and attempt to fit these generated 
processes to a model with an exponential kernel. Furthermore 
we also impose the intra-second randomisation procedure em- 
ployed in [14| . Despite the fact that the generated datasets 
represent critical Hawkes processes with n = 1.0, the analysis 
(incorrectly) reveals a sub-critical branching ratio which has 
been steadily increasing since 1998, as reported in [l4[. Fig. 
4-D. 

Why is our result n « 1 over the whole period so differ- 
ent from the one presented in FS [l3|? To reconcile our 
result with theirs, we propose that a sub-critical branch- 
ing ratio n < 1 which increases over time can arise from 
the inadequate choice of an exponential short-memory 
model to fit a critical, power-law Hawkes process with a 
time dependent high-frequency cut-off tq. To this end, we 
have performed MLE fits of the exponential kernel model 
to synthetic data that we generated using the power-law 
kernel hypothesis. We fixed {fi,e,n} = {0.02,0.15,1.0} 
(values which are approximately representative of the full 
fourteen years studied) but allowed the cut-off parameter 
To to vary according to the MLE results of Fig. [TJ For 
each two-month time period, we generated an ensemble 
of 100 power-law Hawkes processes from which, follow- 
ing FS's procedure, we drew a 30-minute window. After 
randomising the time-stamps within the second, as in 
14| . the window was fit to the exponential kernel model 



To complement and back up the results presented in 
Section IIIII we also estimate the shape of the Hawkes 
kernel for several times in the 14-year period using non- 
parametric means. In jlTj , the authors present a method- 
ology for estimating the kernel of multivariate Hawkes 
process based on Eq. ^ and the determination of the 
empirical auto-covariance of the event rate, Eq. ([SJ. We 
have repeated their analysis in the monovariate case for 
our data set, choosing h = 0.001s for 2009 (the maximum 
temporal resolution of our data}, h = 0.01s for 2005 and 
h — O.ls for 1998. Following [l3l the auto-covariance is 
also discretely sampled at a rate A — h and the Fourier 
transform is performed in practice with DFT. The result- 
ing kernel estimate is subsequently averaged with loga- 
rithmic bins to reduce noise for presentation. 

To minimise the effects of intra-day seasonality, event 
rates are again detrended by dividing by the daily event 
rate as averaged for 5- minute bins over each year studied. 
Furthermore the empirical auto-covariance is estimated 
on 45-minute windows, during which we can expect the 
event rate to be relatively stationary anyway. The auto- 
covariance estimate used to produce the kernel in the 
range 1 ms - 1000s in Fig. [3] is based on an average over 
all such 45-minute windows in a full year (1998, 2006 or 
2009). 

The non-parametric procedure confirms the MLE re- 
sults and also captures the change in the high-frequency 
cut-off between the two years. Note that the strong os- 
cillations in the 1998 kernel estimate for r < Is are not 
random noise, but the result of a spurious correlation in 
how the data (although time-stamped to the millisecond) 
is collected or aggregated. However, the MLE estimation 
of the kernel shape seems robust to these data anomalies. 

As an additional check of the goodness of fit of the 
power-law Hawkes process, we consider the residual pro- 
cess. We re-express the point process in terms of a trans- 
formed time t t* which is the integral of the condi- 
tional intensity. 



X{s)ds 



(16) 



where A(.) is calculated using the model on events prior 
to time t. If the Hawkes process with our chosen kernel, 
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FIG. 3: Non-parametric kernel estimation of the Hawkes ker- 
nel in 1998, 2006 and 2009. The power-law form with decay 
exponent e « 0.15 is reproduced in all three years. However 
the high frequency cut-off of the kernels are considerably dif- 
ferent. The kernel estimates match well with the MLE fits 
(solid lines) using the average parameter estimates for each 
year as taken from Fig. [1] 



(/)(e, n, To), is indeed a good description of the data, then 
the transformed process N{t*) should be Poisson with 
unit intensity. We assess this fit by analysis of the arrival 
times between events in the transformed time t* , which 
if unit variance Poisson should be described by the law 
P{At*) =exp(-Ar). 

The results of this analysis for periods in 1998 and 2009 
are given in Fig. El The PDFs presented are produced 
from the inter-arrival times of the residual process for all 
points in the given two month periods. The kernel pa- 
rameters used in the analysis are extracted from the MLE 
results of Fig. [TJ The Kolmogorov-Smirnov distance (i.e. 
the maximum deviation between the empirical CDF and 
the Poisson expectation) is 0.011 in 1998 and 0.031 in 
2009. 



Jan-Feb 1998 



Jan-Feb 2009 




FIG. 4: The empirical distribution for the inter-arrival times 
of events in the transformed time of Eq. p6[) . The parameters 
for the power-law kernel used are extracted from the MLE 
results of Fig. [1] 



V. THE SHAPE OF THE HAWKES KERNEL AT 
LARGE LAGS 

The results in the previous section demonstrate that 
a Hawkes process described by a power-law kernel with 
an exponent e « 0.15 is indeed a good fit to the data in 
the millisecond to 5-minute region. We will now show 
that there is a marked regime change in the nature of 
the correlations for larger time horizons. To see this, we 
use Detrended Fluctuation Analysis (DFA) ^3^, a com- 
monly used tool for detecting long-range correlations in 
the presence of trends and non-stationarity. More pre- 
cisely, we compute: 



F{L) = 



\ 



(17) 



where N(t) is the empirical event counting process 
formed by concatenating all regular trading hour peri- 
ods in 1998-2011. N{t) is discretised into T 0.1 second 
bins and Jl (t) is a piecewise function formed by the lin- 
ear regression to the points N{t) in non-overlapping win- 
dows of length L which cover the period studied. If N{t) 
behaved as a Brownian motion with uncorrelated fluctu- 
ations, as for a simple Poisson process, we would expect 
F{L) ~ with a Hurst exponent H = ^. In the pres- 
ence of long-ranged correlations induced by a power-law 
Hawkes kernel, one finds: 



(18) 



Note that this result is not intuitive, since the larger the 
value of e, the faster the decay of the Hawkes kernel, but 
the stronger the deviation from the Poisson result H — ^. 
This paradoxical behaviour was in fact already apparent 
in the relation q = 1 — 2e derived in Sect. Ill Bl above. 

Fig. [5] clearly shows two regions. For small time win- 
dows {L < 1000s), the Hurst exponent is found to be sa 
0.63, consistent with the observation of e « 0.15 in the 
previous section. For longer time windows (L > 1000s), 
the diffusion of the integrated event rate becomes more 
super-diffusive and leads to a stronger Hurst exponent 
of « 0.95. Such a change to a more super-diffusive 
regime has been observed before for measures of market 
volatility [H, HI] . However these previous studies have 
reported the cross-over at the scale of one day, whereas 
in our dataset for event rate it is happening much faster 
(10 - 20 minutes). We have repeated the same analysis 
for absolute value of five-minute returns and confirmed 
that the cross-over time is indeed longer in that case, on 
the scale of a day. We attribute the difference to the fact 
that the distribution of five-minute returns is well known 
to have fat-tails, much fatter than the distribution of the 
number of price changes over five minutes. The extra 
noise brought about by these tails reduce short-time cor- 
relations, and therefore bring the value of H closer to 
1/2. 
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The critical power-law Hawkes kernel implied by the 
stronger autocorrelation H « 0.95 corresponds to a 
larger decay exponent of e w 0.45. To visualise the shape 
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FIG. 5: Detrended Fluctuation Analysis on the counting pro- 
cess for market events in the period 1998-2011. We identify 
two regimes, one at high frequencies corresponding to a Hurst 
exponent of iif ~ 0.63 and one at much lower frequencies cor- 
responding to ~ 0.95. 




Time lag x (minutes) 



FIG. 6: The result of non-parametric kernel estimation on 
the auto-covariance of event rate as calculated using 5-minute 
bins for the full 14-year period. We plot 1 — $(r) = 1 — 
(l){s)ds in a log-log representation in the main hgure and 
"I>(r) as a function of r"'^ '*^ in the inset, clearly demonstrating 
that <;/>(«) integrates to unity and has a power-law tail. This 
tail exponent e — 0.45 is consistent with the DFA analysis in 
Fig. [5] 



of the Hawkes kernel at these large time scales, we es- 
timate the auto-covariance of event rate but now using 
5-minute binned data over the full 14-year period. To do 
this, we again treat the event rate dataset as one large 
continuous time series by concatenating all regular trad- 
ing periods. The resulting kernel estimate is shown in 
Fig. ini Two results can be read from this figure: a) 
again, the Hawkes kernel indeed integrates to unity, con- 
firming the critical nature of the process implied by the 
long-memory of event rates (as discussed in Sect. |lll); b) 
the exponent e = 0.45 extracted from the DFA analysis 
correctly describes the time dependence of the Hawkes 
kernel in the region ^ 1000 seconds 10^ seconds (40 
days). The value e = 0.45 leads to a ~ 0.1, which is 
indeed similar to the value of the exponent for the decay 
of volatility correlations found in the literature (see e.g. 

MM)- 

VI. CONCLUSION 

In this study, we have confirmed that Hawkes pro- 
cesses provide a useful framework to describe the high- 
frequency activity in financial markets. However, con- 
trary to Filimonov &; Sornette [14], but in agreement 
with Bacry et al. llTj], we have found that the Hawkes 
kernel (that describes how past events influence future 
activity) decays as a power-law, rather than exponen- 
tially. The branching ratio we extract from the cali- 
bration of our model to the mid-point changes in the 
E-mini S&P over 14 years is close to critical and has 



remained constant over time, contrary to the claim of 
Filimonov & Sornette that self-reflexivity has substan- 
tially increased since 1998. Nonetheless, we uncover that 
the short lag cut-off of the power-law kernel which we 
fit has significantly decreased over time, with the emer- 
gence of higher frequency trading and higher temporal 
resolution in the market. We have argued that this ex- 
plains the discrepancy between our results and those of 
Filimonov & Sornette. Using both a non-parametric 
method and Detrended Fluctuation Analysis, we have 
found that the Hawkes kernel must in fact be described 
by two power-laws: one decaying as « ^--i is for short 
time scales (r < 1000 seconds), crossing over to a faster 
decay w -r"^ "*^ for longer time scales. This long time 
regime is consistent with the slow decay of volatility cor- 
relations reported in the literature. 

The picture which emerges from our analysis is one in 
which bursts of diverging trading activity are just as in- 
evitable now as they were in 1998 (and probably as far 
back as financial markets existed) , but that the time-scale 
over which they may occur has substantially shortened in 
the last decade. We have shown that the Hawkes kernel 
must indeed be critical whenever the event arrival is a 
long-memory process. Based on this observation and on 
the results of Bacry et al. [l3| on the Bund and the DAX 
futures, we conjecture that our critical market scenario 
is common to most actively traded markets. Interest- 
ingly, financial markets appear to be well described by a 
critical Hawkes process "without ancestors", a theoreti- 
cal scenario put forth by Bremaud & Massoulie [1^. In 
non-technical terms, this means that the rate of endoge- 
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nous events in financial markets is overwhelming com- 
pared to that of exogenous (news related) events. A de- 
tailed mechanism explaining why markets are precisely 
poised at criticality is however still lacking. 
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